Spring 2009 CSC 466 : Knowledge Discovery from Data

نویسنده

  • Alexander Dekhtyar
چکیده

Definitions Information Retrieval (IR). The process of finding documents from a given document collection that are relevant to the user's query. Document collections. The key assumption of IR is that document collections are large. Note: This is not always the case. There are some specialized uses of IR techniques, where the document collections are on the order of tens or hunderds of documents, not hundreds of thousands/millions as is usual for traditional IR settings. Queries. A user query is a formal representation of an information need. The two are not the same. An information need is an articulated desire to obtain certain information (or to find documents that contain certain information). A query is a representation of an information need that can be processed by the specific Information Retrieval system. Example. The user information need is to find all information about the University of Alabama's 1992 football season, in particular, to find the list of all games. The user query to a search engine (google) can be: "University of Alabama Crimson Tide football 1992 season games schedule". (Given this query, Google returns the wikipedia page on the 1992 Alabama football season, with the list of games as the top page).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spring 2009 Csc 466: Knowledge Discovery from Data Alexander Dekhtyar Classification Methodology

} of attributes, and an additional categorical attribute C, which we call a class attribute or category attribute. The learning dataset is a relational table D. for each element of the dataset we are given its class label. the class labels of the records in D are not known. Classification Problem. Given a (training) dataset D, construct a classifica-tion/prediction function that correctly predi...

متن کامل

Cal Poly Csc 466 Knowledge Discovery in Data Web Structure Mining (and Associates)

Overview Terminology: • Link Analysis: analysis of graph structures. • Web Structure Mining: analysis of the web graph. • Social Network Analysis: analysis graphs representing relationships between humans (social networks).

متن کامل

Cal Poly CSC 466 : Knowledge Discovery from Data

• Data mining: the techniques, methods and algorithms for finding patterns in structured data. • Data warehousing: the methods and techniques for managing data and processing complex analytical decision-support queries in databases. • Information Retrieval: the techniques, methods, algorithms and data models for finding information in unstructured (primarily, but not always, textual) data. • Co...

متن کامل

CD133 negative cancer stem cells in glioblastoma.

Glioblastomas (GBM) are paradigmatic for the investigation of cancer stem cells (CSC) in solid tumors. Recently, the discovery of CD133- CSC in addition to CD133+ CSC has substantially added to our understanding of the complexity of GBM CSC. This review gives an overview on our current knowledge on CD133- cells in GBM and describes five different hypothesizes on the nature of CD133- cells in GB...

متن کامل

Knowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services

The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009